A class of punctured simplex codes which are 
proper for error detection 
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Abstract — Binary linear [n, k] codes that are proper for error 
detection are known for many combinations of n and k. For the 
remaining combinations, existence of proper codes is conjectured. 
In this paper, a particular class of [n, k] codes is studied in detail. 
In particular, it is shown that these codes are proper for many 
combinations of n and k which were previously unsettled. 

Index Terms — Error detection, proper codes, satisfactory 
codes, simplex codes, punctured codes, ugly codes. 



I. Introduction 

In this paper, we study binary linear [n, k] codes (codes 
of length n and dimension k) used for error detection on the 
binary symmetric channel. A comprehensive introduction to 
the field is given in flT). The basic definitions are given in 
Section [TT] A main quantity is the probability of undetected 
error of a code. If the probability of undetected error is an 
increasing function on the interval [0, 1/2], the code is known 
as proper for error detection. 

It is believed that proper codes exist for all lengths n and 
dimensions k. However, this has been shown only for some 
cases. In particular, proper [n, k] codes are known to exist for 
any given k when n is sufficiently large. The best known result 
in this direction was given by Kl0ve and Yari J2] who showed 
that proper codes exist for 



n > 2 



k-li 



2 L(fc-5)/2j\ when k>h _ 



(1) 



In this paper, we study a particular class of [n, k] codes 
where n > 2 k ~ 1 . One of our results is that these codes 
are proper for many values of n and k where the existence 
of proper codes was previously unknown. In particular, we 
improve the bound (Q]). 

We first consider n in the range 2 fc ~ 1 < n < 2 k . The 
Hamming bound proves that the dual of an [n, k] code in this 
case has minimum distance at most 3. Moreover, an [n, n—k] 
code with minimum distance 3 can be obtained by shortening 
the [2 k — 1, 2 k — 1 — k] Hamming code. 

Two [n,k] codes C and C are equivalent if there exists a 
permutation it of {1,2, ... ,n} such that 

C = {(cK(l),Cn(2),---,Cn(n)) I ( c l: c 2, • • • , Cn) € C) . 

If two codes are equivalent, then it may happen that the 
corresponding (repeatedly) punctured codes are not equivalent. 

Let Hk be some k x (2 k — 1) matrix having as columns 
all possible nonzero vectors of length k. The code generated 
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by Hk is the simplex code Sk, and the code having Hk as 
parity check matrix is the well-known Hamming code. Note 
that the order of the columns is not specified; all the equivalent 
codes are named Hamming codes. However, when we want to 
puncture the code, the order is very important. 

We remind the reader that puncturing a code is equivalent 
to shortening the dual code. Davydov et al. (3) determined 
an ordering of the columns in Hk such that any of the cor- 
responding (repeatedly) shortened codes contains a minimal 
number of codewords of weight three; the shortened [n, n — k] 
codes are obtained by removing 2 k — 1 — n columns from 
Hk to get a k x n matrix M n> k and use this matrix as the 
parity check matrix for the [n,n — k] code. They showed that 
a possible choice of M n< k is to have as columns the vectors 
that are the binary representation of the numbers from 2 k — 1 
down to 2 k — n. For example 

"11111111000 
11110000111 
11001100110 
10101010101 



Mi 



1,4 



(2) 



We let D nt k denote the code generated by M„ fc. For our 
investigation, we will consider codes that are equivalent (but 
not equal) to these codes; we will denote them by S n ,k- A 
main reason for considering S n ,k rather than D n ,k is that the 
determination of the weight distribution is easier for S n .k- 

In this paper, we investigate the performance of the codes 
S n ,k when they are used for error detection. We compute their 
weight distribution that, in turn, permits us to calculate the 
undetected error probability P ue (S n ,k,p)- However, when the 
code length n is large (n > 2 20 ), the polynomial expressing 
Puc(S n .k,p) may be difficult to evaluate, even when the weight 
distribution is known. For this reason, in the general case, 
we also find bounds on the length and dimension such that a 
necessary condition for codes to be satisfactory does not hold, 
using a method similar to the one proposed in |4|. 

The paper is organized as follows: in Section [II] we give 
some preliminaries on error detection; in Section [Til] we 
describe the construction of S n< k and show that S n .k and 
are equivalent; in Section [TV] we determine the weight 
distribution of S n ,k\ in Section[V]we study the undetected error 
probability of S n ,k and its dual code; in Section[vT]we give an 
asymptotic analysis; in Section IVIII we give a generalization 
of the construction to lenghts n > 2 k , finally, in Section [Villi 
we summarize our results. 

II. Error detection 

We start by defining P uc (C,p), the undetected error proba- 
bility for an [n, k] code C when used on the binary symmetric 



2 



channel with error probability p: 

n 



(3) 



where A w is the number of codewords having Hamming 
weight w, see e.g. fl] Section 2.1.2]. 

One can also express this polynomial in terms of the weight 
distribution of the dual code, see e.g. 0] Theorem 2.4]. If 
is the number of codewords having Hamming weight w in the 
dual code G , we have: 

n 

P uc (C,p) = 2 k ~ n J2 A ^ - 2 P) W ~ C 1 - PT- W 

As mentioned in the introduction, if P ue (C,p) is an increas- 
ing function on [0, |] , the code C is called proper for error 
detection. If 

Pue{C,p) < Pue(C,l/2) 

for every p £ [0, |] , C is called gooc/ for error detection. If 

P ue (C,p) < 2 k ~ n 

for every p 6 [0, i], C is called satisfactory for error 
detection, see (U p. 38]. A code that is not satisfactory is 
called ugly. When a code is proper then it is satisfactory; so, 
if it is ugly it is clearly not proper (nor good). 

III. The code construction 

We first describe a particular parity check matrix H k for the 
Hamming code. For < m < k — 1, let be the k x 2 m 

matrix constructed as follows: 

• The first k — m — 1 rows are all-zero vectors. 

• Row k — m is the all-one vector. 

• In the m x 2 m matrix consisting of the last m rows, the 
columns are ordered lexicographically. 

Then 



H 



(fc-1) | H (k-2) 



H, 



(0) 



We illustrate this with an example. For k = 4, we get 



H 



(o) 



"0" 




"00" 




"0000" 






rr(l) 
) "4 — 


00 

11 


, i?i 2) = 


1111 

0011 


1 




01 




0101 



H. 



and so 



11111111 
00001111 
00110011 
01010101 



Hf\Hf\H^\Hf ] 

111111110000000" 
000011111111000 
001100110011110 
010101010101011 



We let H k (n) denote the k x n matrix containing the first 
n columns of H k . For example 



#4(11) = 



11111111000 
00001111111 
00110011001 
01010101010 



(5) 



We let SVi.fc denote the code generated by Hk(n). We 
see that S 2 k-i k is the first order Reed-Muller code and 
2*— i, k = $k, the simplex code. Both of these codes are 



S- 



known to be proper (and this is easy to show). The Hamming 
_ 1 fc . The code having Hk(n) as parity check 



code is S^k 



matrix is a shortened Hamming code which we denote by 
C n ,n-k- We note that C„,„_fc = S^ k . In the rest of the paper 
(except Section [VTlb we will assume that 2 fc_1 < n < 2 k — 1. 
Theorem 1: The codes SVi.fc and D n ^ are equivalent. 
Proof: We first illustrate by the example k = 4 and n = 
11, that is, the matrices (0 and (fJJ. Adding the second row 
in (|5]l to the third and forth rows, we get 



11111111000 
00001111111 
00111100110 
01011010101 



(6) 



This is an alternative generator matrix for 511.4. The last three 
columns are the same in (0 and (O, and the first eight columns 
of (O are a permutation of the first eight columns in (O. 
Hence, 5x1,4 and 2?ii,4 are equivalent. 

In the general case, if n e [2 k - 2 fe -'» + 1, 2 k - 2 fe -" 1 - 1 - 1] 
for some m, 1 < m < k — 1, we add row m + 1 in Hk{n) 
to all the rows below. This gives an alternative generator 
matrix H' k (n) for S n .k- The first 2 fc ~ 1 columns of H' k (n) are a 
permutation of the binary representations of i E [2 fe_1 , 2 k — 1], 
the next 2 k ~ 2 columns of H' k (n) are a permutation of the 
binary representations of i € [2 fe ~ 2 ,2 fe ~ 1 — 1], etc. The final 
n — 2 k + 2 k ~ rn columns in H' k (n) and M n ,k are the same. 
Hence, S n ,k an d Dn,k are equivalent. 

If n = 2 k — 2 k ~ m for some m, 1 < m < k, the same 
argument shows that the columns of Hk{n) are a permutation 
of the columns of M n ^, and so again S n ,k and D n ^ are 
equivalent. ■ 

IV. Weight Distribution of S n .k 

The main question we consider is: for which n and k is S n ^ 
proper for error detection? We will also in some cases consider 
the simpler question: for which n and k is S n & satisfactory 
for error detection? 

We note that this is equivalent to the question: for which n 
and k is C„. n -k satisfactory for error detection? The reason 
is the following known lemma. 

Lemma 1: QQ Theorem 2.8]. A code is satisfactory if and 
only if the dual code is satisfactory. 

To determine the probability of undetected error for S n 
we have to determine its weight distribution. This is done in 
this section. We break the argument down into a number of 
lemmas. 
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We first give some further notations. We observe that the 
matrix 



H 



(fe-i) 



H 



(fc-2) 



H 



(k—m) 



has length 



V 2 k ~ j = 2 k - 2 k ~ r 

3=1 



For a given n, let m be determined by 

2& r)k—m <^ ^ <^ 2^' 2^ — m— i 

Since 2 fc ~ 1 < n < 2 fe , we have 1 < m < A; 
(ai, Q-2, ■ ■ ■ , ctk) denote the last column of Hk{n). 
Lemma 2: Let 2 k - 2 fc ~ m < n < 2 k - 2 k ~ m - 1 . 

ai = . . . = a m = 0, a m+ i = 1, 

and a m+ 2 , . . . , at are determined by 



fc — m — 2 

E 

i=0 



a fc _, 4 2* = ji - 1 - 2* 



- 1. Let 
Then 

(7) 
(8) 



Proof: The last column in -fffc(n) is a column in 
jj{k m i) jj ence f n ows Moreover, its number in 

fc— m— 1) 



is n - 1 - (2 k - 2 



k—r 



when we count the 



first column as number zero. The columns in m ~ 1 ^ are 
ordered lexicographically and so $Q follows. ■ 
Let Wi denote the weight of the i-th row in Hk{n). As usual, 
[x\ denotes the largest integer less than or equal to x. 
Lemma 3: Let 2 k - 2 k -' m < n < 2 k - 2 k ~ m - 1 . Then 



— ? 



and 



If 771 



If 771 



Wi = ■■■ = w m = 



w m+1 =n-2 k 1 + J 
2 < i < k and a^ = 0, then 

2 k-t 



fc-i 



fc— m— 1 



tl'i 



71 — 1 I 

2 fc -' i + 1 J' 



2 < i < k and a, = 1, then 

71—1 

Wi = n - 2" 



- 2 



k—i 



(9) 
(10) 

(11) 

_ 2 fc- i+1 j - ■ (12) 

Proof: All the rows of iJ^ have weight 2 k ~ 1 . The first m 
rows of Hk (n) are obtained from rows in removing some 
zeros. Hence Wi = 2 fc_1 for 1 < i < m. 

Before we go on with the proof, let us take a closer look 
at Hj, m \ Row i > m consists of consecutive blocks of zeros 
and ones, each block of length 2 k ~ % . We use the term double 
block for a zero-block combined with the following one-block; 
it has length 2 fc ~ 4+1 . Now, let 

U ~, , 2 fc ~ 4+1 + iy where 1 < v < 2 k -' l+1 . (13) 
2fc— *+i J — — 



Then row i in Hk(n) consists of 
length 2 fe ~ ' +1 , each of weight 2 fe ~ ' 



2 k-i+i double blocks of 

, followed by an incomplete 
double block of length v that has to be considered further 
(when v = 2 fc ~ l+1 , the incomplete double block is, of course, 
a full double block). 

If a, = 0, the incomplete double block is all zero, and so 
dTTb follows. 



If ai = 1, the incomplete double block consists of a full 
block (of length 2 k ~ l ) of zeros followed by an incomplete 
block of ones of length v — 2 k ~ l . Hence 



w. 



2 k ~ l 
2 k-t 



n—1 
2fc— i+i 
n — 1 
2fc— i+i 

n — 1 



■jk—i 



2 k— 1+1 



- 2 k ~ l 

71—1 

_2 k -' l + 

- 2 k ~ i 



_\ 2 k-i+i 



•ik—i 



This proves ( TTZb . The proof of ( fTOb is similar (and even 
simpler). ■ 
Lemma 4: Consider sums of rows from Hk{n). 

a) Any of the 2™ — 1 non-zero sums of some of the first m 
rows have weight 2 k ~ 1 . 

b) Any of the 2 m sums containing row m + 1 and zero or 
more previous rows have weight w rn+1 . 

c) For ill + 2 < i < fc, 2 l ~ 2 sums containing row i and 
some previous rows have weight Wi and the other 2*~ 2 sums 
have weight n — mj. 

Proof: For each sum of rows from the first m, the 
corresponding sum of rows in are codewords in the 
simplex code Sk- These always have weight 2 k ~ 1 . Since only 
zeros have been removed to get the corresponding rows in 
iJfc(n), their sum also has weight 2 fe_1 . This proves a). 
Let i > m + 2. We note that in the set of positions of a 

(k—i) 

double block in row i > m + 2 in HI for j < m + 
1, the elements of any previous row are all zero or all one. 
Therefore, the weight of these positions in any sum of row i 
and a combination of previous rows is 2 k ~ % . 

It remains to consider the contribution to the weight from 
the last v positions (where v is defined by (fT3]>). 

Case I) ai = 0. In this case, all the last v elements of row i 
are zeros. Any previous row has all zeros or all ones in these 
positions, and so the weight of the elements in these positions 
in any sum is either or v. Hence, the weight of the sum is 
either Wi or n — Wi, where Wi is given by (fTTT i. Moreover, 
row Tii + l has all ones in the last v positions. Hence, half of 
the 2 l ~ 1 sums has weight Wi and the other half has weight 

71 — Wi. 

Case II) ai = 1. In this case, all the last v elements of row 
i are 2 k ~ % zeros followed by v — 2 k ~ % ones. The weight of 
the last v elements in a sum is therefore v — 2 k ~ % or 2 fc- \ 
Hence, the weight of a sum is w% or n — Wi, where now Wi 
is given by (fT2l . As done above, considering sums containing 
row Tii + 1, we can see that the multiplicities of these two 
weights are the same. This proves c). 

Finally, consider row m + 1. Any previous row has all zeros 
in the last v positions. Hence, the weight of any sum involving 
row m + 1 and previous rows is w m+ \. This proves b). ■ 

We next give an alternative expression for m;. 



Lemma 5: Let 2 k — 2 



k — m 



< 71 



<^ rfk rfk—m—1 



a)lfm + 2<i<k and on = 0, then 



!<>; 



■yk-1 



)fc— m— 1 



i-1 

E 

j=m+2 



a 3 2 



k-i-j 
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b) lfm + 2<i<k and cti = 1, then 

i-l fc 

= 2 k - 1 -2 k - m - 1 + l+ "j 2 ' 1 ' ' E a J 2fe " 3 - 

j — m+2 J=i+1 

c) Further, 



iu m+ i = 2 fe ~ 1 - 2 fe -'"- 1 + 1 + ^ «j 2fc " J - 



j — m+2 



Proof: From (O we get 

n - 1 = 2 k ~ 2 k ~ m 



E 

j=m-\-2 



Hence 



n- 1 



yi — 1 c\i — m 



j=m+2 



where 



X ay 2 ! - 1 - J ' < 5] 2 i - 1 - J = 1 - 2 l - 1 - k < 1. 



Hence 



n- 1 



i-l 



2k-i+l 

and so, by ( fTTT i. 

n - 1 



j=m+2 



10,; = 2 



fc-i 



I nfc - 1 cik—tn—1 i \ ^ _ 1- 

i+rj = 2 ^ 2 + X a J 2 



j=m+2 



This proves a). 

Similarly, if a» = 1, ( fT2b gives 



to,- = n - 2 h 



n- 1 



~tk—i 



= l + 2 k - 2 k - 



53 a j 2 k - j -2 k - i 

j=m+2 

i-l 

_ 2 fc " 1 + 2 fc ^" 1 - 1 - 53 a j 2 k ~ 1 ~ J 

j=m+2 
i-l 

l + 2 k - 1 -2 k - rn - 1 + 53 "i 2fe ~ W 

j=m+2 

fc 

+ a, 2 fe "' + 53 a J 2k ~ j - 2 * _< 
j=i+i 

i-l 

l + 2 k ~ 1 -2 k - m - 1 + 53 o t 3 2k ~ 1 ~ j 

j=m-{-2 

fc 

+ E a i 2fe_i ' 



since Oj = 1. This proves b). Finally, c) follows directly by 
substituting the expression for n in the expression for w m +\ 
in ([TOll. ■ 

Example 1: Consider n = 2 k — 2 fe_m_1 , where 1 < m < k. 
We have 

cti = for 1 < i < m and cti = 1 for m + 1 < i < k. 

Using Lemma [5] we see that S n ,k has 2 m — 1 codewords of 
weight wi = 2' £ ~ 1 and 2 k — 2 m codewords of weight 

2 fe_l _ 2 fe_ m _ 2 = 

In particular, the minimum distance is n/2. Hence, the code 
S n ,k is proper (see (JJ Theorem 2.2]). o 
Lemma 6: Let 

2 fc _ 2 fc-m < n < 2 fc _ 2 fc-m-l 

and m + 2 < i < k. 

a) If a t = then w l+1 = w x . 

b) If a.i = and Qj + i = 1, then Wj+i > Wi. 

cl) If a, = 1, aj + i = 0, and <x, = 1 for all j such that 
? + 2 < j < fc, then = Wj. 

c2) If aj = 1, = 0, and ay = for at least one 

j > i + 2, then w i+ i > Wi. 

d) In all cases, 

W\ > W m +l > W k > Wk-1 > Wfc-2 > • ' ' > Wm+2- 

In particular, the minimum distance d of S n .k is w m +2- 

e) w m+ i > n/2. 

Proof: a) If cti = a»+i = 0, then Lemma [5] gives 

w i+ i -Wi = an 2 k ~ 1 ~ l = 0. 
If cti = ai+i = 1, then Lemma [5] gives 

ofc— i — 1 nfc — (i+1) n 

«;,-+! - Wi = cti 2 - « l +i 2 ' ^ ' = 0. 

b) If ccj = and c^+i = 1, then Lemma [5] gives 

fc 

w i+ i - Wi = i + 53 a J 2 ' c_: ' > °- 

j=i+2 

c) If cti — 1 an d cti+i = 0, then Lemma |5] gives 

fc 

Wi+i - Wi = 2 k - 1 - 1 - 1 - ^ aj 2 k -K 

We have 

fc fc 

53 otj 2 k - j < 53 2fe ^ = 2fc_1_1 - 1 

j=i+2 i=i+2 

with equality if and only if aj — 1 for i + 2 < j < k. 

d) We have 

fc fc 
53 aj 1 k ~' J < E 2fe_J = 2 fe " m_1 - 1, 

m+2 j— m+2 



and so 

Wm+l < 2^ 



fc — 1 o fc — m — 1 _j_ t^k—rn—X 



1 = U'l . 
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Further, both for ctk — and ctk = 1, Lemma [5] gives 

fc-i 

w m +i -w k = l+ a o 2 fc_J ' _1 > 0. 

j=m+2 

For m + 2 < i < k, a), b), cl), and c2) show that tu, > 
e) Equation ( fTOb implies that 

n - 2w rn+1 = 2 k - 2 k - m - n < 0. 



TABLE I 

The weight distribution of S 2 k- 3 . 2 h—m-2 % 



Let 

n(k, m) 



_ 2 fc _ 2 fc - 
= 2 fc -3-2 



2 — ?7 1 ^ 2 2^ 2^ — m— i 2 

k — m — 2 m_ ^ (cym-\-2 



! (2 m+2 -3), 



k — rn — 2 

(14) 



the midpoint of the interval [2 k - 2 fc ~"\ 2 k - 2 fc -"'- 1 ] 
Lemma 7: Let d be the minimum distance of S nt k- 

a) If 2 k - 2 fc " m < n < n(k, m), then 

d = 2 fe ~ 1 - 2 k - m -\ 

b) If n(k, m) < n < 2 k - 2 fc " m - 1 , then 

n-d = 2 k ~ 1 -2 k - m - 2 . 
Proof: a) We have 

Hence a m +2 = 0. By Lemma |5^), 



„ . rtk — l oft— m— 1 

Wm+2 =2 - 2 



b) We have a m+2 
n - w m+2 



1 From ([8]) and Lemma |5t>), 

+ «m+2 ' 



nk — 1 o&— m — 1 I ~ rtk — m — 2 

2-2 + a m + 2 2 



2^—1 2^ ,— m— 2 



Lemma 8: a) If ii> m +2 = Wfc, then 

A d = 2 fc_1 - 2 m . 
b) If w m +2 < Wk and z > m + 2 is given by 

W m +2 = Wi < Wi+1, 

then 

A d = 2 1 - 1 - 2 m . 

In particular, > 2 m in all cases. 

Proof: The conditions imply that Wj has value d exactly 
for m + 2 < j < i (where i = k for case a)). Hence 



A d = J2 v ~ 2 = 21-1 - T 

j=m+2 



w 


-<4 kj 




2^ — 1 _ 2 ^ — m— l 

2^ — 1 ^ _ 2^~ m — 3 

2^ — 1 2^~ m ~2 

2 *-l 


1 

2 m 

2^ _ 2 m +2 

2T71+1 

2 m - 1 



V. Probability of undetected error of S n ,k AND 

Cn ,n — k 

From (01, @, and Lemma[4] we get the following theorems. 

Theorem 2: Let 2 k - 2 k ' m < n < 2 k - 2 fe - m ~ 1 . Then 

Puc(5„, fc ,p) = (2 m - -p) B -™ 1 

+ 2 m JJ i Dm+I ( 1 _ JJ jn-« m+ i 
ft 

+ ^ 2 4 - 2 {^'(l-p)™-^ +p"- lu '(l-p)' iu '}. 

Theorem 3: Let 2 fe - 2 fe -'" < n < 2 k - 2 k ~ m ' 1 . Then 

2 k P uc (C 1hn - k ,p) = (2™ - 1)(1 - 2p) TO1 
+ 2 m (l-2p)™™+ 1 

+ Yj 2 i - 2 {(l-2p)"' i + (l-2p)™-' iu '}. 

Example 2: Consider n = n(fc, rri) = 2 k — 3-2 k ~ rn ~ 2 where 
k > m + 3. Then 

cti = 0, for 1 < i < m, 

Q!to+1 = 1; 
«m+2 = 0, 

a L = 1 for ?7i + 3 < i < k. 
Using Lemmas [3] and [5] we get 



Wi 


= 2 k ~ 


-l 


for 1 < i < m, 


Wm+1 


= 2 k ' 


-l 


2^ — m— 2 


Wm+2 


= 2 k ' 


-l 


2 fe— m— 1 


n - w m+2 


= 2 k ' 


-l 


2^'— m — 2 


n — Wi 


= Wi 




2^—1 ^ 2 k ~ m 



for m + 3 < i < k. 

Hence, the weight distribution of S n .k is given by Table U o 

Example 3: For k = 9 and 777 = 1 in Example [2] we get 
77 = 320 and 

PueOW,?) -2p 128 (l -p) 192 + 504p 160 (l -p) 160 
+ 4p 192 (l-p) 128 +p 256 (l-p) 64 . 

In Fig. Q] we give the graphs of P ue (6*320,9 :P) an d the terms 
2p 128 (l - p) 192 and 504p 160 (l - p) 160 '. The contributions 
from the last two terms, 4p 192 (l — p) 128 and p 256 (l — p) 64 
are so small that they are not visible on the graph. The 
graph illustrates that 6320,9 is ugly. For small p (p up to 
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V 

Fig. 1. Plot of P uc (5 3 20,9,p) (solid line), 2p 128 (l - p) 192 (dashed line), 
504 p 160 (1 -p) 160 (dotted line), and 2 9 ~ 320 (long dashed). 



approximately 0.42), 2p 128 (l — p) 192 is the dominating term; 
in this region the difference 

P uc (S 3 20,9,p)-2p 12S (l-p) 192 

is so small that it is not visible on the graph. For p close to 
0.5, the term 504p 160 (l - p) 160 dominates. o 

Theorem [2] can be used to determine if the code S n> k is 
proper and Theorem [3] if the code C n . n -k is proper. We just 
compute and check the presence or absence of real roots 
in (0, i). For moderate values of n and k (e.g. k < 20), this 
is feasible in a reasonable time. 

Before we give a main general result, we quote two lemmas 
from 0]. 

Lemma 9: a) HI Theorem 2.2: if w > n/2, then 



p w (l-p) n - w 

is increasing on [0, 1/2]. 

b) fl] Lemma 3.5: if (n - y / ri)/2 <w< n/2, then 

p w (l-p) n - w +p n - w (l-p) w 

is increasing on [0, 1/2]. 
Let 



Tfc,m = mM 2 



k-m-2 1 + VT + 2 fc+2 - 2 fc ~ m + 2 ) 
2 J' 



Z k ,m = min <! 2 



iin|: 



k-m-2 



1. 



-1 + VTT2^+ 2 ~ W Im + T 



} 



Theorem 4: For fc > m > 1, if 

2 k - 2 k ~ rn < n < 2 k - 2 k ~ rn 



Tk 



or 



tk— m— 1 



r fe , m < n < 2 k - 2 k 



(15) 
"\ (16) 



then S n ft is proper. 

Proo/: For n = 2 fc - 2 fc - m and n = 2 k - 2 fc -™- 1 , S n , k 
is proper by Example Q] For 2 fc - 2 fc - m < n< 2 k - 2 k - m ~ 1 , 
(O, Lemma |6}l), and Lemma |6^) imply that 



wi = ■ ■ ■ = w m > w m +i > n/2 

is increasing on [0, 1/2] for 1 < i < 

rj, where < 77 < 2 k - m ~ 2 . 



and so p w *(l - p) n ~ u 
m + 1 by Lemma |9^). 

Now, consider n = 2 k — 2 k ~ 
By Lemma |7^) 



d = 2 k ^ - 2 k ~ m - 1 < -. 
By Lemmas |6}l) and|9j3), 

p Wi (1 - p) n - Wi + p n ~ w * (1 - p) w ' 
is increasing for m + 2<i<kif2d>n — s/n, that is, if 
2 k - 2 k ~ m = 2d> 2 k - 2 k -' n + n- yj2 k - 2 k ~ m + rj. 
This is equivalent to 



and 



\j2 k - 2 k ~ m + n > n 



2 k - 2 k - m + r) > n 2 . 



Solving this for 77, we get 



V 



< 



1 + y/1 + 4(2 fc - 2 k ~ m ) 



We see that if ( fTBI l is satisfied, then all the terms in 
Puc(S n ,k,p) are increasing on [0,1/2]. Consequently, 5„.fc is 
proper. 

Next, let n = 2 k -2 k -' m - 1 -n where < 77 < 2 fe - m - 2 -l. 
Then, by Lemma [7J5), 



d = 2 fe - x 

We want 

2 k - 2 k - ,n - 1 -2n = 2d 



■)fc— m— 2 



>2 k - 2 k - m - 1 -n- ^/2 k - 2 k ~ m - 1 - n. 
Solving for rj, we get 7/ 2 + n < 2 2 — 2 k ~ m ~ 1 and so 



n 



< 



-1 + y/l + 4(2 fc - 2 k - m - 1 ) 



As above, if dT6b is satisfied, then S n .k is proper. ■ 
When S n .k is proper, then it is satisfactory, and so, by 
Lemma Q] C n , n -k is satisfactory. Hence we get the following 
corollary. 

Corollary 1: If n is in the range defined by ( fTBI ) or ( TToT l for 
some m > 1, then C n , n -k is satisfactory. 
Theorem 5: a) If 



m > 

then S 1 ,!^ is proper for all n £ 
b) S n ,k is proper for all n £ 



(17) 



2 k _ 2 k 



k—r 



-yk — ni— 1 



,2 k 
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Proof: We have Tk, m = 2 fe m 2 if and only if 

1 + s/i + 2 k + 2 - 2 fe -™+ 2 



^k — r 



> 2 



k—m—2 



(18) 



< A d - 



1 

n 



(20) 



We observe that 2 



k — m — 2 



decreases with increasing m and 
vl-|- 2 fe + 2 — 2 fe ~" l + 2 increases with increasing m. 

Let a; = 2 k ~ m ~ 1 . Then (TT~8T > is equivalent to the following 
sequence of inequalities 



where 



h(x) = -x log 2 x - (1 - x) log 2 (l - x) 



1 + \J\ + 2 fe + 2 - 8x > x, 



2x+l, 



For k 



1 + 2 k+2 -&x> {x-l) 2 = z 

x 2 + 6x < 2 k+2 . 

2 m + 3 we get x = 2 m+2 and so 

2 + 6x = 2 2m+4 + 6-2 m+2 <2 2m+5 = 2 fc+2 



(19) 



for all m > 1. However, for k = 2 m + 4 we get ir = 
and so 

x 2 + 6x = 2 2rn+6 + 6 • 2 rn+3 > 2 2m + 6 = 2 fc+2 



Hence, ( fT8l is satisfied if and only if fc < 2 m + 3, that is 
when ( [T7| i is satisfied. Therefore, if ( fT7] i is satisfied, then S n k 
is proper for all n e [2 k - 2 fc -"\ 2 k - 2 k - m + 2 fe -™- 2 ]. 

Next, since -2 k - m+1 > -2 k - m + 2 , we see that if (T8) is 
satisfied, then 



-1 + Vl + 2 k + 2 - 2 k - m+1 1 
> — 



VI + 2 k + 2 - 2 k - m + 2 



1 



Hence, r fc m = 2' £ -" i - 2 - 1, and so S. 



n G f2 fc - 2 



ni fe is proper also for all 



This, combined with the result above, proves a). 
Since S n ,k is proper for 



is the binary entropy function. 

We can reformulate this to the following well-known suf- 
ficient condition for a code to be ugly (see e.g. JT] Theorem 
2.11] or 0): 

Ad>2 k-n+nh(d/n)_ ^ 

We showed in Example Q] that S , 2 &_ 2 fc - m - 1 ,fc i s proper for 
all m < k. In general, S n ,k and C n .n-k may be ugly for some 
values of n when 2 k - 2 k - m < n < 2 k - 2 k ~ m - 1 . 

We have checked that 5„.fc is proper for all n > 2 fe ~ 1 when 
k < 8. When fc > 9, S n .k is ugly for some values of n. An 
example is 6*320,9 in Fig. Q] 

Lemma 10: For a given k, let 

</(n) = fc — n + nft,^— ^j. 

Then g(n) is increasing with n on [2 k — 2 k ~ m , n(k, m)] 
and decreasing with increasing n on [n(k, m), 2 k — 2 k ~ rn ~ 1 ], 
where n(k, m) was given in (TT~4-b . 

Proof: From the definitions of /i(ar) and g(n), we get 

<7(n) = k — n + n log 2 n — d log 2 d — (n — d) log 2 (n — d). 

By Lemma |7h), d is constant for n G [2 fc — 2 fc ~ m , n(fe, m)]. 
Considering n as a real variable for the moment, direct 
calculations gives 

dg(n) 
dn 

Since d<n/2, we get < 



= -l + log 2 (- 



<= [2 fe - 2 fc_m ,2 



Similarly, for n £ [n(k, m),2 k — 2 k m 1 ], n — d is constant 
by Lemma I7J5), and so 

dg(n) 



for all m > 



fc-3 



, b) follows. ■ 
Based on the previous theorems, we have found a set of 
values of n for which S n ,k is proper and, hence, satisfactory. 
For other values of n, the existence of real roots of in 
(0, i) must be checked. However, for large values of k and 
n in general it may be difficult to numerically compute the 
polynomial's real roots, or even just to determine the existence 
of real roots (e.g. using Sturm's chain). However, in many 
cases we can decide that the code S n> k (and hence C n . n -k) 
is not satisfactory (i.e. ugly) by showing that P ue (C,p) > 
2 k ~ n for some value of p. How should the value of p be 
chosen? There is no theory that can give an exact answer 
to this question. However, it is known that if the minimum 
distance of the code is d, then AdP d (l — p) n ~ d is often the 
dominating term of P uc (C,p), except for large p. This is well 
illustrated by the example of 8320,9 given in Fig. Q] Since 
p d (l — p) n ~ d has its maximum for p = d/n, a good choice 
for p may be p = d/n. This gives the following sufficient 
condition for S n .k to be ugly: 



dn 



= -1 



log 2 Q)>0. 



Note: The weight distribution of S n ^k. m ),k was given in 
Table U In particular, Ad = 2 m for n — n(k,m). Moreover, 
A d > 2 m for all n G [2 k - 2 k ~ m ,2 k - 2 fc -'™- 1 ]. Hence, (ED 
is satisfied for some such n if and only if it is satisfied for 

n = n{k, m). 

For n = n(k,m) = 2 k ~ m ~ 2 (2 rn+2 — 3) we have, from 
Table U that 



and so 



d = 2 fc -™- 1 (2 m - 1) 



2 m+l _ 2 



n 2™+ 2 -3' 



For a fixed m, let 



G(k) =k-m-2 



k — m — 2 



where we consider k a real variable, and where 

( / om+l _ 0\ 1 



(22) 



(23) 
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Then ( |2TT i. for n = n(k, to), can be rewritten as 

G(k) < 0. (24) 

We get 

G'(fc) = 1 - 2 h - m - 2 U m In 2, (25) 
G"(k) = ~2 k ~ m - 2 U m (\n2) 2 . (26) 

To analyze G(k) further, we first give some relations for 
h(x). 

Lemma 11: For < x < 1/2, we have 

(1 - 2a;) 2 



TABLE II 

The values of K(m) for 1 < m < 356 



h(x) < 1 



2 In 2 



(27) 



and 



Proof: Using Taylor's theorem, we get 



1 ~ (l-2x) g 



In 2 ^ 2i(2i- 1) 

for < x < 1. 

The upper bound ( f2Tb follows immediately since 



(29) 



/i(x) = 1 - 



(1 - 2x) 2 l ~ (1 - 2.x) 



In 9 ^ 



2; 



21n2 In 2 ^ 2i(2i - 1) ' 
Next, from (l29l , we get 

oo 

= ft(0) = 1 V — ; . 

V 1 In 2 ^ 2i(2i- 1) 

I— 1 ' 

Since (1 - 2x) 21 < (1 - 2x) 4 for i > 2 and x e (0, 1/2), we 
get 

fe (x)>l- (1 - 2a;)2 -J-V (1 " 2X)4 
21n2 ln2-^ 2i(2t-l) 



(1 - 2x) 2 (1 - 2x) 



21n2 In 2 ^ 2i(2i - 1) 



4 oo 

2z(2i- 1) 



1 



(l-2x) 2 (l-2x) 4 



21n2 



In 2 



Lemma 12: For to > 1, we have 

2 m+3 J7 m ln2 = l + u m , 

where 

3 3 (21n2-l)2 m + 2 

< u m < 



2m+2 _ 3 m 2" l + 2 — 3 (2 m + 2 — 3) 3 
Proof: We have 

2 m+l _ 2 1 

1-2 



2m+2 _ 3 2 m + 2 - 3 ' 

Hence, from d23l l and d2Tb we get 

2 rn+3 U m In 2 > 2" l+3 In 2 (2 m+2 - 3) 



2 In 2 (2 m + 2 - 3) 2 



in £ 


{1} [2,4J [5,14] [15.36J 


K(m) 


9 2 m + 8 2m + 9 2m + 10 


m 6 


[37.81J [82,172] [173,356] 


K(m) 


2m +11 2m + 12 2m + 13 



This proves the lower bound on u m . Similarly, d23T l and (l28t 
imply the upper bound on u m . ■ 
In particular, ( f24b and Lemma [121 imply that G(m + 1) > 
for all to > 1. Since G"(k) < for all k and G(k) -> -oo 
when fc — > oo, we see that G(fc) = has a unique root in 
[m + l,oo), we denote it by k(itl). Further, G(k) > for 
to + 1 < k < K,(m). Also, G(k) < and G(k) is decreasing 
for k > k. 

Lemma 13: For to > 1, k(to) is not an integer. 

Proof: For n = n(k, m) = 2 fc - m - 2 (2 Tn+2 - 3), we have 

d = 2 fe - m - 1 (2 m -l), 

n-rf = 2 fe -" l - 2 (2 m+1 -l). 

and ^4^ = 2 m . By definition, fc = k(to) if 

(s) 

or equivalently, 

2 k-n n n = 2 m d d ( n _ d) n-d_ (3Q) 

Hence, if k = n{m) were an integer, then the exact powers of 
2 dividing the two sides of (f30b would be the same. We will 
show that this is not the case. 
The exact power of 2 dividing 2* 



, rl \ d / A \ n — d 

2 k - n = 2 m (-) (1- 



fc — n + n(fc — to — 2). 

The exact power of 2 dividing 2 m d d (n — d) n ~ d is 

to + — to — 1) + (n — d){k — to — 2) 
= to + d + n(fe — to — 2) 
> k — n+ n(fc — to — 2), 

that is, we have a contradiction. Hence, k(to) is not an integer. 

■ 

Let K(m) be the smallest integer k such that G(k) < 0. 
Then K(m) > k(to) and so we have the following: 



K(m) = [~k(to)1 . 



(31) 



2 m + 2 - 3' 



In Table HIl we give the values of K(m) for m < 356. 

We will next determine good bounds on k(to). These can 
in turn be used to determine K(m). We use the notations 

A = log 2 (ln2) -0.5287663728, 
A = 5 + A, 
fx = /u(m) = to + A, 
p(m) = to + fx + log 2 = 2to + 5 + log 2 (^ In 2) 
= 2to + 5 + log 2 fi + A. 

Lemma 14: Let c > 0. 
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TABLE III 

Values of n for which (f2Tb is satisfied, and therefore, both S„ ik and C n , n _ fc are ugly 





TTl — 1 


m = 2 


TTl — 3 


m = 4 


9 


[315, 324J 








10 


[599, 676] 








11 


[1140,1396] 








12 


[2219, 2878] 


[3286, 3367] 






13 


[4331,5853] 


[6458, 6844] 






14 


[8540, 11878] 


[12717, 13888] 


[14812, 14883] 




15 


[16870, 23966] 


[25208, 28006] 


[29371,30013] 




16 


[33486,48290] 


[50034, 56408] 


[58305, 58368] [58370, 60396] [60416, 60461] 


[62460, 62468] 


17 


[66546, 66560] [66593, 97028] 


[99602, 113304] 


[116102,121434] 


[124378, 125472] 


18 


[132560, 194804] 


[198432, 227423] 


[231354,231424] [231451,243631] [243712,243730] 


[247954, 251741] 



a) If 



then 



b) If 



then 



(2 C - IV + 2 C M u m > c + log 2 n, (32) 

k(to) < p(m) + c. 
(2 C - 1)m + 2 c /i u m < c + log 2 /Lt, (33) 

k(to) > p(m) + c. 

Proof: Let fc = p(to) + c = to + yU + log 2 fx + c. Since 
/i = m + 5 + log 2 (In 2), we have 

G(fc) = k-m- 2 k ~ m - 2 U m 

= fj, + log 2 jU + c— 2 ro+3+c /x In 2 U m . 

By Lemma [121 

G(fc) = fx + log 2 /i + c - 2 c yu (1 + u m ). 

If d32b is satisfied, then G(k) < and so k > /t(m). This 
proves a) and the proof of b) is similar. ■ 
Let 



iuln2' 

log 2 M _ (log 2 /^) 2 
/iln2 2^ 2 ln2 ' 

Corollary 2: If m > 2, then 

w(m) < k(to) < ZU(m). 



w(m) = p{m) + 
w(m) = p(m) + 



Then 



Proof: First we consider the upper bound. Let c = 



(2 c -l) M =( e cln2 -l)/i = / i^ 



(log 2 ttf 
ilfi 1 



> log 2 H 



(log 2 

2;U 



log 2 [i 

> log 2 ^ H — = c + log 2 |U 

/iln2 

for log 2 (j, > w 2.885, that is, /x > 7.389, i.e. m > 3. For 
m = 2 we get (2 C - - (c + log 2 fi) w 0.0468 > also. 
In particular, d32l is satisfied for all m > 2. The upper bound 
therefore follows from Lemma [l4h). 



The proof of the lower bound is similar. For m = 2, 3, 4 we 
can show it by direct computation. Some calculus shows that 
(2 C — < log 2 /j, for all to > 2 and fj,u m < c for to > 5. 
We skip the details. The lower bound therefore follows from 
Lemma US'). ■ 

From Corollary |2] we immediately get the following result. 

Corollary 3: We have re(m) — p(m) —> when m — > oo. 

Theorem 6: For all to > 2, we have 

if(m) = [w(to)] (34) 



if(m) = [w(to)] + 1. 

In particular, if there is no integer between uAm) and w(to), 
then 

K(m) = |~o;(to)] . 

Proof: Since SJ(m) < w(m) + 1, (|3T1 i and Corollary |2] 
imply that 

[w(to)] < X(to) = f«(m)l < T^ 7 ")! < M m )l + 1- 

Further, if there is no integer between w(to) and uJ(m), then 
[cj(to)] = [Zj(to)]. ■ 
The difference 



w(m) — tj(m) 



(log 2 /^) 2 
2^ 2 In 2 



is small, except for small to. Hence, to have an integer between 
Lj(m) and u7(m), cJ(m) must be close to and above an integer. 
If we denote this integer by 2to + 5 + u, then to must be close 
u — 5. We will make this statement more precise in 



to 



In 2 



the following lemma. 

Lemma 15: Let u be a positive integer. 

a) If 

2 U „ u 2 
TO < u — 5 , 

- In 2 2"' 

then ZTJ(to) < 2m + 5 + u. 

b) If 

2 U u 2 
to > u — 5 H . 

- In 2 2"' 

then w(m) > 2m + 5 + u. 

Proof: In this proof, we let to be a positive real variable. 
We note that ZU(m) and w(to) are still well defined. Moreover, 
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simple calculus shows that dni J-' m '> > and d —^> > for all 

r dm dm 

m > 0. Therefore, a) is equivalent to 

_/ 2 U u 2 \ 

lu u — 5 < 2m + 5 + u 

Vln2 2 U J 

and similarly for b). 

Proof of a). Let m = r~n — u — 5 — Then a = — y, 

y In 2 2 U ' In 2 y ' 

where 



and so 



y = u - A + — 



log 2 n = log 2 (^— ) + log 2 ( 1 - 



yln2 

2 U 
y In 2 



Hence 



= it - A H In 1 

In 2 V 2 U 

y 

< u — A — < u — X. 

2 U 



W(m) = 2m + 5 + A + log 2 fi + -^f 2 -^ 

/i In 2 

2/ u - X 



<2m + 5 + u- 
< 2m + 5 + u 



2 U 2"-yln2 



if 



V > u- X 



(35) 



2 U ~ 2 U -y\n2 
The inequality ( 1351 ) is equivalent to 

2 

( u -A+| J )(2«-2/ln2) >2>-A) 
which in turn is equivalent to 

U " A+ f) ln2 - 

This is satisfied for all u > 6. For 1 < u < 5, we can show a) 
directly by numerical computation. This completes the proof 
of a). 

The proof of b) is similar. We give a sketch, leaving out 
some details. Let m = y-* — u — 5 + Then /i = ^ — y, 
where 



and so 



y = u — A — — 



1 , / yln2 

log 9 u = m — A H -ml 

62 P In 2 V 2" 



> u - X - 



!J 



We have 

w(m) = 2m + 5 + A + log 2 /i + 



2" - yln2 

log 2 n (log 2 /i) 2 



/i ln 2 2/z 2 ln 2 
We observe that the function 

log 2 x (log 2 xf 



log 2 a; + 



is increasing with x. Hence 

uj(m) > 2m + 5 + u — 

U — X — 



y 



2 " - y ln 2 

* — (« - A - — 



2/ In 2 



•y In 2 > 



; ln2 



2 U - yln2 
> 2m + 5 + u 



2(2"-yln2) 2 



X 



2 U - y ln 2 



> 



- jj In 2 



) 2 ln2 



2(2«-j/ln2) 



that is, 



2" 



> 



2 y+("- A -rfl^) 2 ln2 



2{2 U - yln2) 

This is satisfied for u > 2. Direct computation shows that b) 
is true also for u = 1. ■ 
Combining Lemmas IT5b ) and b), we see that if there is an 
integer between u>_(m) and uJ(m), then this integer is 2m+5+M 
for some integer u, and m = m u where 



2 U u 2 2 U u 

u — 5 < m„ < u — 5 H . 

In 2 2 M In 2 2" 



(36) 



We have checked I® for u < 10000. For 8 < u < 10000, 
there is no integer satisfying d36b . For u < 7, there actually is 
an integer m„ satisfying (|36] l. However, in these cases, direct 
computations show that K(m u ) = \ta{m u )~\ . We can therefore 
conclude that K(m) = \u{m)~\ for 2 < m < 2 ■ 10 3010 . This 
is, of course, far beyond what is needed for any practical 
application. Whether there are any u > 10000 such that 
there is an integer satisfying (f36l > remains an ooen question. 
However, the length of the interval in d36b is and 



oo 

E 

u=10001 



2W 2 

2 U 



1.003-10 



-3002 



:ln2 2a; 2 In 2 



Therefore, it is highly unlikely that there is an integer satis- 
fying ( f36l > for some u > 10000. Based on this, we conjecture 
that K(m) = |~w(m)] for all m > 2. 

If k > K(m), we define bi(k,m) to be the smallest integer 
and 6 2 (fc, m) the largest integer such that: 

• bi(k,m) < n(k,m) < 6 2 (fc,m), 

• ( |2~TT i is satisfied for bi(k,m) <n< 6 2 (fc,m). 

In the next section, we give estimates for b\{k,m) and 
b 2 (k,m). 

For k < 18, we have computed the values of n in the range 
[2 k - 1 + 1, 2 k - 1] for which (ED is satisfied. For k < 8, this 
never happens; and we have checked that C n , n -k is always 
proper for k < 8. For 9 < k < 18, the values of n for which 
(l2~TT i is satisfied are given in Table Hill Since K(5) = 19, we 
only have to consider m < 4 when k < 18. 

We see that, in general, for any given k and m the set of n 
where (|2TT > is satisfied consists of zero or more intervals. 

Example 4: Typically, we have several intervals, except for 
small values of k. We describe k — 17, m = 1 as an example 
to illustrate why this is the case. We first give a small list of 
values in Table II VI We see that d2~Tl i is satisfied for n = 66593, 
but not for n = 66592. Since (fJTJ turns out to be satisfied for 
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TABLE IV 

Some values of A d and 2 k ~ n+n Hd/n) for k ■ 
66545 < n < 66593 



17 AND 



n 


A d 


2 & — n-\-n h{d/n) 


66545 


62 


62.4 


66546 


62 


61.5 


66560 


62 


49.7 


66561 


30 


48.9 


66592 


30 


30.3 


66593 


30 


29.8 



TABLE V 

Ranges of n where S„.k is proper when 

2 fe-l + 1 < n < 2 k-l + 2 k-2^ 



10 

11 

12 
13" 

11 
15 
16 
17 
18 



257,307 



513, 
TT025; 



587 



331, 384 



688, 



1124] 



11424. 



[2049, 2195]" 
[4097,4298f 



768] 
1536] 



[8193, 8489T~ 
[16385, 16798]" 



[2904, 3072]~ 
[5908, 6144]~ 



[32769 
[65537 



33376] 



[11937, 
[24081, 



12288 



245761 



[48422, 
[97245, 



49152] 
98304] 



[131073, 132321] [195096, 196608] 



66593 < n < 97028, we get h (17, 1) = 66593. For n in the 
range [66561,66593] we have A d = 30. Since 2 k ~ n+n 
is decreasing with increasing n, ( ED is not satisfied for n in the 
range [66561,66592]. However, we see that for 77 = 66560, 
we have a jump in the value of Ad compared to n = 66561, 
and ED 

is again satisfied for all n in the range 66546 < n < 
66560. For 77 = 66545, ED is again not satisfied. o 

Example 5: An interesting example occurs when k = 16 
and 777 = 3; 77 = 58369 = 61 (16, 3) — 1 is an isolated value 
of n for which ED is n °t satisfied. We have 



n = 2 16 - 2 13 + 2 10 + 1 and d 
We have Ad = 8 and 

A d {d/n) d {l - d/n) n - d ^ 

ok—n 



,15 



0.989 



,12 



and so ED is not satisfied. However, Ad+i — 16 and 
A d+ i (d/n) d + 1 {l-d/n) n - d - x 



2 k- 



1.9106, 



so the contribution from this term alone is sufficient to 
conclude that 558369,16 is not satisfactory after all. This shows 
that if (ED is not satisfied, but the second lowest weight of 
S n .k is close the minimum weight d, it may be a good idea 
to consider the contribution from this weight also. o 

For k < 12 we have checked if the codes S n ,k and C n . n -k 
are proper when (1211 is not satisfied. It turns out that this is 
always the case for C n . n -k, but not for S n ,k- As an illustration, 
in Table [V] we give the range of values 77 for m = 1 and 
9 < k < 18, such that S n .k is proper. 



VI. Approximate Analysis 

In Table Hn] we see that for a given m, an increasing fraction 
of the codes are ugly when k increases. Define pi(k, m) and 
(3 2 {k,m) by 

h(k, to) = 2 k - 2 k - ,n + to) 

and 

b 2 (k,m) = 2 k -2 k - m +f3 2 (k,m). 
Clearly, < (3i(k,m) < /3 2 (k,m) < 2 fc - m " 1 . Let 
71 (fc, m) = (k — m) hi 2 



yk—r 



)ln2. 



+ J{k - TO) 2 (ln2) 2 + 2(k - w)(2 fc 

Theorem 7: We have 

Pi(k,m) < l7x(k,m)]. 

Proof: Let n = 2 fe - 2 fe -™ + a, where < a < 2 fe -™- 2 . 
By Lemma H d = 2 k ^ - 2 fc -™-\ and by Lemma [8] A d > 
2 m . By (ED we see that if 

to > k — n + nh(d/n), (37) 

then A d > 2 m > 2 fe -"+ n,l ( d /™), and so S n , k is ugly. We have 

n — 2d a 
n n 



l-2 d - 

n 



By ED, 



Hence 



1 



h - < 1- 

,77/ 21n2 n 2 



n + n /i ( — ) < k 

\n<> 



2n In 2 



2(2 fc -2 fe -" l + cr)ln2' 



By (O, if 



> k 



(38) 



2 ( 2 fc _ 2 fc-m + cr) In 2' 

then S n .k is ugly. Since ( 1381 is equivalent to er > 71 (fc, m), 
and er is an integer, that is a > [71 (fc, m)~\, we can conclude 
that (3i(k,m) < [71 (fc, to)]. This proves the theorem. ■ 

Remark. We see that /?i(fc, to) — 1 is an upper bound on the 
number of n in [2 fe -2 fc -" l + l, 2 fe -2 fe - m + 2 fe -" 1 - 2 ] such that 
SVi,fc is satisfactory. Therefore, a main corollary of Theorem 
[7] is that, for any fixed to, /3i(fc, m) /2 k ~ m ~ 2 converges to 
exponentially fast when k increases. 

For 2 k - 2 k - m + 2 fc -™- 2 < n < 2 k - 2 fc - m -\ we have a 
similar result. Let 

j 2 (k, to) = — (fc — to) In 2 

+ ^{k- TO) 2 (ln 2) 2 + 2{k - m){2 k - 2 k -™- 1 ) In 2. 

Theorem 8: We have 

/3 2 (fc,TO) > 2 k - m - 1 - [ 72 (fc,TO)l. 

Proof: The proof is similar to the proof of Theorem [7] 



Let 
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where < ( < 2 k m 2 . From Lemma [7] we get 

d = 2 k ~ 1 - 2 k -' m - 2 - C, 

and so 



1-2 



n - 2d _ C 
n 



n n 

Therefore, analogously to ( f38l > we get 

c 2 



> k- 



2(2 k - 2 fe -™- 1 - C)ln2 

is a sufficient condition for S n ,k to be ugly. Since this is 
equivalent to £ > [~7 2 (fc, to)], Theorem [8] follows. ■ 

Theorem 9: Let N k be the number of n G [2 k ~ 1 ,2 k - 1] 
such that SVi,fc is satisfactory. Then 

2 (fe+5)/2 



N k < k + 



Vfc 3 In 2. 



Proof: The number N k , rn of satisfactory codes for n in 
the interval [2 k - 2 fe ~ TO + 1,2* - 2 k ~ m - 1 } is at most those 
for n G [2 k - 2 fe ~ TO + l,6i(fc,m) - 1] plus those for n G 
[6 2 (fc,m) + 1,2* - 2*- m_1 ]. The number of n in the first 
interval is 

6i(Jfc,m) - 1 - (2 k - 2 k ~ m ) = ft(fc,m) - 1 

< [7i(fc,m)l - 1 < 7i(fc,m). 
The number of n in the second interval is 

2 fc _ 2 fc-m-l _ m ) = 2 k-m-l _ m ) 

< [7 2 (fc,m)l < 7 2 (fc,m) + 1. 
We see that, for 1 < m < k — 1, we have 
0-m)ln2 +2(2 fc -2 fe - m ) 

< (k- m)ln2 + 2(2 fe - 2 k -™- 1 ) < 2 k+1 . 

Hence 

N k>m < 7i(fc,m) + 72 (fc,m) + l < 1 + 2^J (k - m)2 k + 1 In 2, 
and so 

fc-i 



^ = 1 + 2 ^ 



m— 1 

fe-1 



< 1 + (1 + 2J(k - m)2 k + 1 In 2 



m— 1 

fe-1 



/c + 2 ^ ^/(fc-m)2 fe + 1 ln2 

m— 1 

fe-1 

fc + 2( fc+3 >/ 2 yirT2^ Vfc^ 



<A; + 2( fc + 3 )/ 2 v / irr2--7fc 3 ". 

3 



Corollary 4: 



9fc-l 



0, 



when k —> oo. 

Theorem [9] shows that C„ „_fc is ugly for most values of n. 
On the other hand, C„ is satisfactory for many values of 
n as shown by Corollary [T] 



VII. A GENERALIZATION OF THE CONSTRUCTION 

The matrix H k was defined by concatenating H^~ m ^ for 
m = 1, 2, . . . , k. We can generalize this by concatenating t\ 
copies of H^~ x \ followed by £ 2 copies of H^~ 2 \ t 3 copies 
of H^ k 3 \ etc. for any sequence (ti,t2, • • • ,tk) of positive 
integers. Most of the previous results carries over, with obvious 
modifications. For now, we only consider the construction with 
ij = 1 for i > 2, and we write t\ = t. As before, we use the 
notation S rlyk for the codes generated by the first n columns 
of the matrix. For large t, these codes have low rate. The dual 
codes will have very high rate and minimum distance 2. 

Consider S n '_ k generated by H k (n'), and let S 7ltk , where 

n = 2 k -\t - 1) + n' G [2 k ~H, 2 fe ~ 1 (< + 1) - 1], (39) 

be the code generated by the matrix 

t-i 



H k (n) = H, 



(k-i) lH (k-i) 



■ ■ ■ \H; 



(fe-1) 



H k {n'). 



We see that we get a code S n . k for each n > 2 k 1 . Also, 
given n, the values of t and n' are uniquely determined by 



From its definition, we immediately get the following 
lemma. 

Lemma 16: a) The weight of the first row of H k (n) is 
2 k ~ 1 (t — 1) larger than the weight of the first row of H k (n'). 

b) For any other non-zero codeword in S n ,k, the weight 
is 2 k ~ 2 (t — 1) larger than the weight of the corresponding 
codeword in H k {n'). 
In particular, we see that 

• The minimum distance of S n . k is 2 k ~ 2 (t— 1) larger than 
the minimum distance of H k {n'). 

• For a non-zero codeword of S n . k of weight w, either 
w > n/2 or there is a unique other codeword in the code 
of weight n — w. 

This last property was used to prove Theorem [4] Therefore, 
this theorem can be directly generalized by a similar proof. 
Let 



^"i,fe,m 



— t,fe,' 



mini 2 



,k — m — 2 



1 + yjl +2 k+1 (t + 1) - 2 k ~ m + 2 



min{2 fc - m - 2 - 1, 



-1 + ^1 + 2 fc +! (t + 1) - 2 fc -™+! 
2 

Note that ri,fe, TO = r fcjm and z l k m = r fe m . 
Theorem 10: For t > 1 and fc > m > 1, if 



2 k - 1 (t + l)-2 k - m <n< 2 k - L (t+l)-2 k - m +T t . k . m (40) 



ife- 



tk-li 



k-r 



or 



2 A; " 1 (t + l)-2 



fe — m — 1 



< n < 2 fc " 1 (i + l) — 2 



fe — m — 1 



(41) 

then S n> k is proper. 

The proof is similar to the proof of Theorem [4] and is 
omitted. 
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Theorem 11: a) If to > 1 and 

-fc-3-log 2 i 

TO > 



(42) 



then 5 n ,fe is proper for all 

n € [2 fc_1 (* + 1) - 2 fe - m , 2 fc ~ 1 (i + 1) - 2 fc -™- 1 ]. 
b) 5 n ,fc is proper for all 

i i 7 fc — 3 — logo t a 

h£ 2 fc ~ 1 (i + 1) - 2 k ~\ ^ l,2 fc_1 (i + 1) - 1 . 

Proof: Similarly to Theorem [3] we see that if 

x 2 + 6x < 2 k+1 {t + 1), (43) 
where x = 2 k ~ m ~ 1 , then S n ,k is proper for all 

n e [2 fc_1 (t + 1) - 2 fe -"\ 2 fe " 1 (i + 1) - 2 fc - m - x ]. 

We see that if 2 fe - 2m ~ 3 > t + 1, then x 2 > 2 k+1 (t + 1) and 
so d43l is not satisfied. However, if 



(44) 



then .t 2 < 2 k+1 t and so 

x 2 + 6x< 2 fc+1 t + 6-2 fc ~ T "- 1 < 2 k+1 (t + l) 

for 6 ■ 2~ m ~ 2 < 1, that is, all to > 1. Since (|44T i is equivalent 
to — 2 m — 3 < log 2 t, we get the theorem. ■ 

Theorem 12: For fc > 6 there exists an integer 0(k) < 2 fc_5 
such that 5 rlj fe is proper for all n > 2 k ~ l 9(k). 
Proof: For fc > 6 and t > 2 k ~ 5 , we get 

fc — 3 — log 2 1 ^ k — 3 — (fc — 5) ^ 
2 ~ 2 ~ 

By Theorem [Tib), S*„ jfc is proper for all 

n e [2 fc - x (i + 1) - 2 fc -\2 fe - 1 (t + 1) - 1]. 

This implies that S n .k is proper for all 

n > 2 k ~ 1 ■ 2 k ~ 5 = 2 2k ~ 6 . M 

Corollary 5: If fc > 6, then S n ,k is proper for all 

n > 2 2k - 6 - 3 • 2 k - 3 + 2. 

Proof: By Theorem [121 S n ,k is proper for all n > 2 2k ~ 6 . 
Next, TheoremfTTb) for t = 2 fe_5 — 1 shows that S n .k is proper 
for 



n e [2 2k - 6 - 2 k - 2 ,2 2k - 6 - 1]. 

fc-5 



Finally, Theorem HOl for t = 2 
implies that S n ,k is proper for 



n e \2 2k - e - 2 fc - 2 



1, fc > 6, and m = 1 
-LT 2fe - 6 _ 1 , M J J 2 2fe - 6 -2 fe - 2 ]. 



It remains to show that 

Ll4Jk-6_ 1)fc) iJ = 2 k 3 

We have 



2. 



(45) 



Tnfc-5_ 



-1 + \/l + 2 2fe - 4 - 2 k 



l.k.l 



-l + %/(2 



fe-2 



-1+ v /(2fe-2_3) 2 + 2 fe - 1 - 



< r 



-l.fe.i 



< 



-l + (2 



fe-2 



2) 



and so, for fc > 6, 

-1 + (2 fc - 2 - 3) 

2 V23 *— 1 >"'> 1 ~ 2 

This proves (1451 . ■ 

Until recently, the best general result of this kind was JJJ 
Theorem 2.64]: If fc > 5 and 

n > (2 k - l)(2 fe -3), 

then there exists a proper [n, fc] code. 

This bound was recently improved in O to the following: 
If fc > 5 and 

n > 2 fc-l( 2 fe-5 + 2 L(fc-5)/2j^ 

then there exists a proper (and self complementary) [n, fc] code. 

Clearly, Theorem[T2] above gives a further improvement and 
is now the best known such bound. 

We can also find lower bounds on 9(k). We consider S n ,k 
for n in the middle of the interval with to = 1, that is 



n = n(fc,l) + 2 fc ~ 1 0-l) 



lfc-3 



(4t + l), 



where n(k, 1) was defined in ( TBI . When we consider only 
the term in S n ,k of lowest degree, we know that the case 
n = n(k, 1) + 2 fc ~ 1 (i — 1) is the worst case (cfr. Lemma [TOb. 
Moreover, this term of lowest degree is the dominating one in 
S n Therefore, it is reasonable to consider these values of n 
when we look for non-proper S n .k- 
Theorem 13: For fc > 6 we have 

a) 6(k) > 6»i (fc) = min{i | S 2 k- 3{it+1) k is proper}, 

(46) 

2 fe ~ 6 1- 
(fc- l)ln2 ~ 4 



b) 



0{k) > 9 2 {k) 



(47) 



Proof: Proof a). We see that if 5 , 2 fc - 3 (4t+i),fe i s not proper, 
then by the definition of 6»(fc), 0(k) > t. Therefore, (|46b 
follows. 

Proof b). Again, consider S n ,k for n = 2 fe_3 (4t+ 1). Then, 
by Table [TJ and Lemma [16] 



d = 2 k ~ 2 + 2 K -*(t - 1) 
2, (fJJJ implies that the code is ugly if 

±>k — n + nn \ — 

n 



2 k ~ 2 t. 



Since A a 



We have 



d 2t d 
- = and 1-2- 

n it + 1 n 



(48) 



4i+l' 



and so, by (1271 . 

— n + nh^—^j < n 
Hence, if 



( 4t +0 

21n2 



)fc-4 



(4i+l)ln2 



■>fe-4 



1 > fc 



(4f + l)ln2 : 
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that is 



■tk-6 



t < 



1 

4' 



l)ln2 

9{k) > t, and the theorem 



then S n ,k is ugly- Therefore, 
follows. ■ 

We now give a lemma that is useful for studying when S Ut k 
codes in general are proper for a given k. 

Lemma 17: Let n > 2 fc ~ 1 + 1 and let w be the weight 
of the first row of H k (n). If P uc (S n , k ,p) - p w (l - p)»-" 
is increasing on [0,1/2], then S n+2 k-i u k is proper for all 
integers u > 0. 

Proof: The weight of the first row of Hk(n + 2 k ~ 1 u) is 
2 fe_1 w + w. The weight of any other non-zero codeword in 
S n +2 k - 1 u,k is 2 k ~ 2 u larger than the corresponding codeword 
in S n k- Hence 



P\ic{S n 



+2 k 



(1- P ) n - 



P 



2 U (l-pf 



Pne(S„,k,p)-p W (l-p) 
'{l-p) n 



is increasing on 

n—w 



By assumption, P ne {S n ,k,p) - P 

[0,1/2]. Since p 2 "~ 2u {l -p) 2 "^ u and p w (l - p)"-" are in. 
creasing on [0, 1/2], we can conclude that P U c{S n+2 k-i u k>P) 
is increasing on [0, 1/2], that is, S n+2 k-i u k is proper. ■ 

For the use of this lemma, it is useful to observe that 
the conclusion of Theorem [10] can be improved: if d40b or 
(ETT i hold, then P ue (S n ,k,p) —p w (l —p) n ~ w is increasing on 
[0, 1/2]. The proof carries over immediately. 

Using Lemma [T7] and computations, we have determined 
9(k) for 6 < k < 17. These values are given in Table [VJ 
together with the lower and upper bounds on 6(k) in Theorems 
[T3land[T2l We have also included the bounds for 18 < k < 20. 
We see that the upper bound is very loose, but 9(k) equals 
the implicit lower bound 6 2 (k) for all k < 17. We conjecture 
that this may be the case for all k. Table [Vl] shows that the 
explicit lower bound 6*2 (ft) is also loose (but substantially 
better than the upper bound) and the ratio 9i(k)/9 2 (k) is 
increasing slowly with k. For k = 13 the ratio is 1.375, for 
k = 16 it is 1.485, and for k = 20 it is 1.555. Theorem M 
below shows that the ratio is always less than 2. The lower 
bound $2 (ft) has the advantage that it is explicit and that it 
shows that 9(k) grows exponentially with k. In Appendix 1 
we prove the following theorem. 

Theorem 14: Let k > 6 and R = 2 fc ~ 4 . Let ?9 = be 
the positive real number defined by 



2(4R - 2) (4tf + 1) [R + ^{R-l) 2 - 8Rfi 



MR+R-l - (40 + l)y/(R - l) 2 - 8iM 
where 



1, (49) 



8Rti + R-1- y/(R-l) 2 - 8iM 
8i?0 + 3R-1+ ^/{R - l) 2 - &M' 



(50) 



TABLE VI 
Values of and bounds on 9(k). 



k 


6 


7 


8 


9 


10 


11 


12 


13 


14 


Lower bound 02 (k) 


1 


1 


1 


2 


3 


5 


9 


16 


29 


Lower bound 9i(k) 


1 


1 


1 


2 


1 


7 


12 


22 


41 


e(k) 


1 


1 


1 


2 


1 


7 


12 


22 


41 




1 


1 


1 


2 


1 


7 


12 


22 


41 


Upper bound, Thm. [721 


2 


4 


8 


16 


32 


64 


128 


256 


512 



k 


15 


16 


17 


18 


19 


20 


Lower bound 02 (k) 


53 


99 


185 


348 


657 


1244 


Lower bound 8i (k) 


78 


147 


279 


530 


1012 


1935 


e(k) 


78 


147 


279 








r*(*)i 


78 


147 


279 


530 


1012 


1935 


Upper bound, Thm. [751 


1024 


2048 


4096 


8192 


16384 


32768 



Then 

a) 0i (ft) < r#(ft)i 

b) < 



)fc-5 



(fc-2)ln2 + ln(fc-3)-l/(2 fc - 3 -l) 2' 



c) 6»i(fc)>0(ft) 



fc-2 
k- 3 



(40(fc) + l)4~ fc . 



Combining Theorem [l4h ) and Theorem [14b). we get the 
following corollary. 
Corollary 6: We have 

O^k) = \§(k)~\ or 6»i (fc) = rtf(fe)l - l. 

Moreover, the first alternative is the most likely. 

We have included in Table [VTl For the range of val- 

ues we have computed, i.e. k < 20, we have 9\{k) = \ r d(k) 1 \. 
If the conjecture that 9{k) = 9\{k) is true, then |~0(ft)] 
is a sharp upper bound on 9(k). Further, if the conjecture 
is true, then S, h k is proper for all n > 2 2/c_6 /(fcln2), a 
substantially stronger result than what we have been able to 
show in Corollary [5] 

For k > 6, let $ fe be the number of n G [2 fc_1 +l ) 2 2fc " 6 -l] 
such that S n k is proper. 

We have shown by direct computation and the use of 
Lemma [TT] that for 6 < k < 8, S n ,k is proper for all 
n G p^- 1 + 1,2 • 2 fe - 1 ]. Hence, $ fc = 2 2fe - 6 - 2 k ^ - 1 
for 6 < k < 8. 

For 9 < k < 12 we know that there are values of n where 
S n ,k is not proper. For given t, k, m, let X t: k,m be the set of 
n€ [2 k - 1 (t + l)-2 k - r '\2 k - 1 (t + l)-2 k '-' rn - 1 } for which 
S n ,k is not proper. The set X t .k. m may be empty. In particular, 
Theorem \TTk) shows that Xt.k.m = if (38) is satisfied. On 
the other hand, Table|V]shows that Xi, k ,i ^ for 9 < k < 18. 
By direct computation and the use of Lemma [17] we have 
shown that the values given in Table I VIII are the only values 
/T, > 2 fc— 1 + 1 for which S n .k is not proper. The computations 
have been extended up to k = 17. In general, the values in 
Xt,k,m are n °t necessarily consecutive. For example 

X n ,UA = [91124, 93142] U [93181,93184]. 

This is similar to what we have in Table Hill and the underlying 
reason is the same. 
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Ranges of n > 2* 



TABLE VII 
_1 + 1 WHERE S n 



IS NOT PROPER. 



k 


m 


-*i,fc,m, 1 < t < 8{k) - 1 


y 


1 


[olio, ooUJ 


10 


1 


[588,687], [1127,1175], [1661,1667] 


11 


1 


[1125,1423], [2200,2402], [3255,3397], 






[4306,4396], [5353,5398], [6399,6401] 




2 


[1661, 1667] 


12 


1 


[2196,2903], [4301,4902], [6393,6893], 






[8500,8901], [10582,10917], [12661,12935], 






[14738,14955], [16813,16977], [18887,19000], 






[20959,21024], [23030,23050] 




2 


[3255,3397], [5353,5398] 



TABLE Vffl 
The values of <E> fe for small k. 



have shown that S, hk is proper for all n > 2 Ik ~ b — 3-2 k ~ i + 2. 
Moreover, S n f. is proper for at least 17/21 of the shorter 
lengths. A plausible conjecture (8(k) = 1 (k)) implies that 
S n .k probably is proper for all n > 2 2fc ~ 6 /(fc In 2). 

An open question for future work is the following: is it the 
case that if there is an n S [2 k - 1 t,2 k - 1 (t + l) - 1] for which 
S n ,k is not proper, then n = 2 fe ~ 3 (4£ + 1) is such an nl If 
the answer is yes (which we believe it is), then this would in 
particular imply the conjecture referred to above. 

Further work may concern searching for modifications of 
the construction that will extend the range of lengths where 
the codes are proper or satisfactory. In particular, one line of 
investigation could be to consider lengths less than 2 fe_1 by 
looking at the the duals of the best known codes of minimum 
distance 4. 



k 


6 


7 


8 


9 


10 


11 


12 




31 


191 


895 


3816 


15715 


63719 


256544 


Bound, ThmfBl 


16 


134 


683 


3023 


12677 


51880 


209866 



The conjecture that 9(k) 
follows: if 



A— 1 



02 (fc) can be reformulated as 
U |J X tXm ± 0, 

t=l m=l 

then n(k, 1) € Xi k,i- In general, we conjecture that if 
X t , k ,m ^ 0, then 2 k - 1 {t - 1) + n{k,m) € X t>k>m . 

From Table IVIII we get the explicit values of $fc given in 
Table IVIIII We have included in the table the lower bound 
given in Theorem [151 below. 

Theorem 15: When k > 6, S n> k is proper for 



17 
21 

fc-i 



55 

y 

i2fe-6 



- 11 



of the values of n € [2 fe_1 + 1,2 
The proof is given in Appendix 2. 

Comment. We clearly have $ fc > 2 fe - 1 (2 fe " 5 - 9(k)). 
The discussion on 6{k) above indicates that we may have 
9(k)/2 k ~ 5 —> 0. If this is the case, then we have 
linife^oo $ fc /2 2fe - 6 = 1. 

VIII. Summary and future work 

In this paper we have analyzed the codes S n ,k and their 
duals C n , n -k (which are shortened Hamming codes) to in- 
vestigate if they are proper or satisfactory codes for error 
detection. 

We have determined the weight distribution of the codes 
S Ut k and computed the undetected error probability. 

For n = 2 k — 2 m , the codes S n k are proper for all 1 < 
m < k — 1, However, for k greater than 8, there are values 
of n such that S n ,k and C n ,n-k are ugly (not satisfactory) 
for error detection. For increasing k, the percentage of such 
codes is increasing. On the other hand, we have shown that the 
number values n such that S n .k is proper grows exponentially 
with k. 

We have given a generalization of the construction which 
defines codes S Ut k for all lengths greater than 2 fe_1 , and we 



Let 



Appendix 1 
Proof of TheoremQj] 



P t {p) = Pue(S 2 k-l t+2 k-^ k ,p). 



The expression for P t (p) that was used to determine the bound 
6x(k) is easily obtained by combining Table U (with m = 1) 
and Lemma [161 Since R = 2 fe ~ 4 , we have: 



Pt(p) 



-2p mt {l-pf m+2R 
4p 4m+2R (l~p) 4m 



+ (16R-8)[ P {l-p)} 4Rt+R 
+ p SRt (l-p) 2R . 



(51) 



We note that Pt(p) is well defined for any positive real 
number t. For large values of t, Pt(p) is increasing on [0, 1/2]. 
For small values of t, Pt (p) is first increasing, then decreasing, 
then again increasing. There is a limiting t such that, for this t, 
there is a po such that P/(po) = and P[{p) > for all other 
p in (0, 1/2). In particular, this implies that P/'(po) = 0. In 
principle, the two equations P[(po) = and P t "(po) = can 
be used to determine po and t = 9q. However, the equations 
are complicated, and we will consider an approximation which 
is easier to handle. We remark at this point that for t close to 
R, P t "(po) = is not possible. 

In this appendix we often drop k from 6(k) and write just 9 
when the value of k should be clear from the context. Similarly 
we write ■& for #o f° r @o(k), etc. 

In Pt{p), the last three terms are increasing on [0,1/2] 
whereas the first term is increasing on [0,d/n] and decreasing 
on [d/n, 1/2]. For all p e [0,1/2], the first two terms are 
dominating. Therefore, we first consider the sum of these two 
terms: 



Mp) = 2p* Rt (l-p) 



\4Rt+2R 



+ (16R - 8) [p(l - p)] 4Rt+R 



and determine the $ and p\ such that f'${pi) = f#(pi) = 0. 
We expect i? to be a good approximation to 9q (and p\ to be 
a good approximation to po). 

The remaining two terms in P®{p) are much smaller. 
Moreover, they are increasing on [0, 1/2]. Therefore, ft > (Jo- 
in particular, 9\ = \9§ \ < \ft~\. This proves Theorem [l4h). 
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We have 

fl(p) = 8Rtp 4Rt - 1 (l- P ) im+2R 

- (8Rt + AR)p 4Rt (l - p )4ra+2fl-i 
+ (16R - 8)(ARt + R)[ P (1 - p)] 4 *^*- 1 ^ - 2p) 
= ARp im -\l-p) ARt+R - 1 

■{[2t-(At + l)p](l-p) R 

+ (4i?-2)(4t + l)(l-2p)p i? }. (52) 

Remark. We see that ft (1/2) = -AR(l/2f Rt + 2R - 1 < 
for all t. Hence, the equation f$(p) = will, in addition to 
pi, have at least one more solution in (0, 1/2). This second 
solution will be closer to 1/2. However, it does not reflect a 
property of P&{p), but only the approximation f$(p). 

Since f#(p) = and < p < 1/2, we have 

A(l -p) R + Bp R = (53) 

where 

A = 2tf - (4tf + l)p and B = (AR- 2) (40 + 1)(1 - 2p). 
Similarly, 

#(p) = 4i? p 4 ^- 2 (1 - P f m+R - 2 {C (l-p) R + Dp R ) 
where 

C = 20(4i?0 - 1)(1 - pf - 80(20 + l)Rp(l - p) 

+ (20 + l)(4i?0 + 2i?- l)p 2 , 
D = (AR — 2) (40 + 1)[(4#0 + i? - 1)(1 - 2p) 2 

-2p(l-p)], 

and so 

C(l-p) R + Dp R = 0. (54) 
Combining d53l and d54l . we get 

(AD — BC)p R = 0. 

Since p fl ^ 0, we have AD — BC = 0, and so 
AD-BC 



= 



(4R- 2)(40 + 1) 

(8i??9 + 2i? - l)p 2 - (8i?t9 + R - l)p+ 2Rd. 



Solving this for p, we get two solutions 

8i?0 + R-1- A Ri # 
2(8R& + 2R-1) [ 

8R-& + R-1 + A Ri) 



Pi 



P2 



2(8RD + 2R-1) 



where 



Ajj, t - V(/i-l) 2 -8ift. 



We see that if 8-dR > (R — l) 2 , then the roots are not real. 
This reflects the fact that f$(po) — is not possible in this 
case. 

The smaller of the two roots is the pi we are looking for, the 
larger pi occurs because we have neglected the two smallest 



terms in ( 1511 as explained above. Therefore, it is not relevant 
for our analysis of P$(p). Since f!^(p\) = 0, (|52l implies 

[2t?-(4t?+l)pi](l-pi) 7? + (4i?-2)(4i? + l)(l-2pi)pf = 0. 

(55) 

Substituting the value of p\ into (IB3T l and simplifying, we get 
<g9}. 

We cannot find a closed expression for but, for a given 
k, we can determine the value numerically. We note, however, 
that ["#] (which is the quantity we want) actually is the least 
integer t such that 

2(4i?-2)(4t+ 1)(R+A) / 8Rt + R- l- A\ R > 

4tR + R-l-(4t+l)A \8Rt + 3R—1 + AJ ~ ' 

(56) 

where A = A^ t and this observation simplifies the numeric 
determination of \$~\ since we do not have to solve the 
equation d49l l, but only search for \$~\ . 

To prove Theorem [T4~b), we first give a couple of lemmas. 

Lemma 18: If < 8Rt <(R- l) 2 , then 

8Rt + R - 1 - A 1 
8Rt + 3R - 1 + A > ~2~t 
Proof: The function 

8Rt + R - 1 - x 



8Rt + 3R-l + x 
is decreasing when x is increasing. We have 

A 2 = (R - if - 8Rt 

= (R- It) 2 -2R - m 2 + 1<(R- At) 2 
and so A < R — It. Hence, 

8Rt + R - 1 - A 8Rt + R - 1 - (R - 4i) 



8Rt + 3R - 1 + A 8Rt + 3R - 1 + (R - At) 
1 4t(4i - 1) + AR - 1 1 
~ 2t + 2t[{8R - A)t + AR - 1] > ~ 2t 



Lemma 19: If < 8Rt <(R - l) 2 , then 
2(AR-2)(At+l)(R + A) 



AtR + R—l— (4t + l)A 
Proof: We have 



> 8-R - 4. 



(At + 1)(R + A) 



> 



(At + 1)(R + A) 



AtR + R - 1 - (At + 1)A AtR + R- (At + 1)A 

R + A 



R 



> 1. 



(57) 



(58) 



From Lemmas Q~8] and [19] we see that if 

then dS6b is satisfied and so t > Taking logarithms of 
we get the equivalent expression 

ln(8i?) + ln(l-^)+i ? ln(l-l)>0. 

Since ln(l — x) > —xj (1 — x) for x e (0, 1), we see that if 
1 R 



\vl(8R) 



2R-1 2t-l 



0, 



(59) 
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then (158) is satisfied, and so i > i9. 

Since i? = 2 fc ~ 4 , we have ln(8i?) = (fc - 1) In 2. Solving 
for t, we get the following relation: 

R 1 



By Theorem [14b) 

R(k-3) 



2(fc-l)ln2-2/(2E-l) 2' 



(60) 



2(fc- + fc — 2 



> 



i?(fc - 3) 



In the proof of Lemma [T9l we used that A > 0. However, 
using d60) we get a better bound on A and hence a stronger 
version of Lemma [T9l and a better bound on 

Lemma 20: If 

R 1 



(fc-1) 
> fcln2 



R 



(fc-2)ln2+ln(fc-3)-l/(2 fe - 3 -l) 



2fc-3 



for fc > 23, and so -k r < 



R ^ „-fcln2 _ o-fc 



2 . Direct computations 



< t < 



2(fc-l)ln2-2/(2i?-l) 2' 



(61) 



then 



2(4i? — 2)(4t + 1)(-R + A) 
4tR+R- 1 - (4f + l)A 

Proo/v By ( |61~b 
A 2 ^ = (iZ - l) 2 - 8itt 
> (i?- I) 2 - 8R 



> (4E-2)(fc-3). 



1? 



8i? 



> if 1 - 



2(fc-l)ln2-2/(2.R-l) 2 
4 \2 

(62) 



fc-1 



for k > 11. Hence, 



fl + A J? + fl ( 1 ~fc A r) fc-3 



R - A 



Therefore, if 

^(l-^)(fc-3)(l-l) fl >l, 

then t > -d. Taking logarithms and solving as above, we get 
Theorem [14b) exactly as we obtained ( f60b from ((58). 

For 6 < fc < 10, we can show that Theorem [14b) is true by 
direct computation. 

To prove Theorem [T4t). we first give another lemma. 

Lemma 21: 

fc — 3 

a) For fc > 6 we have tt < 1 ; r — — ; . 

~ 2(fc - l)tf(fe) + (fc - 2) 

b) For fc > 11 we have tt r < 2~ k . 



Proof: Using ( 162) . we get 

8Rtf + R - 1 - A R # 



< 



< 



8RV + 3R- l + A Rt # 

8Rtf + R- Arj 
8Rtf + 3R + Ar~o 

8ift? + fl-fl[l-4/(fc- 1)] 
8i?i9 + 3 J R + i?[l-4/(fc-l)] 
fc-3 

1 



2(fc - + (fc - 2) 
This is Lemma |2"TV). Moreover, this implies that 



n R < g 2(fc-l)iS + fc-2 



show that 71"" < 2~ fc also for 11 < fc < 22. Hence Lemma 
l2~Tb ) is proved. ■ 
To prove Theorem fT4t). let e > and t > e. We get 

ft-M 

= 8R(t - e)p 4 «(*- e )-l(l _ p )4ii(t-e)+2fl 

- [8i?(t - e) + 4i?]p 4ii (*- e )(l _p)4R(*-*)+2«-i 
+ (16i?-8)[4i?(i-e) + i?] 

. [p(l-p)] 4 «(*- £ )+«- 1 (l-2p) 

= fl( P ) - Sifep^'-^-^l _p)«(*-e)+2i? 

+ 8Re P iR ^\l-p) iB ^ +2R - 1 

- 4ife(16i? - 8)[p(l - p )]4*(t-=)+fl-i(i _ 2p ). 

Since f#( P i) = and (1 — 2pi)/(l — pi) = 1 — 7T, we get 

ti-M = -8 J R£(l-pi) 8fl( "- e)+2i? - 1 7r 4 ^- £ )- 1 
•(l-7r)[l + (8i?-4) 7 r' R ]. 

Let 

g t (p) =4 P im + 2R (l - p) 4 ^ + p* Rt (l - p) 2 «. 

Then P t (p) = f t ip) + g t (p). We get 

gt,_M = (1 _ Pl) ^- £ )+2«-i^- E )-i 

• { [16jR(0 - e) + 8iJ]7r 2ii - 16fl(0 - e)7r 2fl+1 

+ 8i2(i9 - eJtt^*- 6 ) - 2i?7r 4 ^- e ) +1 } 
< 8^(1 - pi fW-e)+™-\4W-e)-i 
•{[2^(l-7r) + l]7r 2i? +^7r 4i? ^}. 
Clearly, P^_ £ { Pl ) = f' d _ £ ( Pl ) + S*- e (Pi) < if 

[2i9(l-7r) + l]7r 2ii <e(l-7r), (63) 

and 



< £(1 _ n ^ 8R _ 4 y 



Equation (1631 is equivalent to 

2*- 1 



tt 2 * < e. 



1 -TT, 

We choose the e which gives equality in ((65), that is 

1 



e= 20 



r 2_R 



(64) 



(65) 



(66) 



1 — 7T 

Equation d64) is equivalent to 

1 ^(f-^-Sfl < 6 (8i? _ 4) _ (67) 
1 — 7T 7H rt 
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By ([55]), 



1 £ 

< 



1 — 7T ~K 2R 

Further, by Theorem [T4"b). 

< (8-R-4), 
and, finally, 7r < 1 and so 

7r 4i?(,3-e)-3i? < j 

Hence, (|64| | is also satisfied. Therefore 

X >#-£, 
where £ is given by (l66*l l. By Lemma |2T1 



(68) 



20 



< 20- 



1 



1 -7T 



2K 



2(fc- 1)^ + /c — 2 



fc-3 



fc-2 
fc-3 



(4tf+l)4 



-fc 



(69) 



Combining d68l i and d69l we get Theorem fl~4b) for fc > 11. 
Direct computations show that it is true also for 6 < fc < 10. 
This completes the proof of Theorem [T4l 

Appendix II 
Proof of Theorem[T31 

Let &t,k,m be the number of n in 

[2* _1 (t + 1) - 2 k - m + 1, 2 fe - 1 (i + 1) - 2 k - m - 1 } 
such that S^fc is proper. Clearly, 

2 fe_5 -l fc-1 

$ fc = E E $ *^™- ( 70 > 

t=l m=l 

Lemma 22: Let fc > 6 and 1 < m < k — 1. 

a) If 1 < m < |_nr J and 1 < t < 2 fe - 2TO " 3 - 1, then 

fc + l r- 

$t,k,m > 2 2 Vi- 1. 

b) If 1 < 771 < and t > 2 fc - 2 ™- 3 , then 

/f. m — 1 

^t,k,m = * 

c) If + 1 < m < fc - 1 and t > 1, then 

,K rjfc — m — 1 

<&t,k,m = 1 

Proof: By Theorem [TOl 

*t,fc,m > \jt,k,m\ + LXt,fe, m J + 1. (71) 

In the proof of Theorem QT| we showed that if 

fc-3 



l<TO<fc-lor£>2 



fc-2m-3 



then 



Now, consider 1 < m < |_^J and 1 < t < 2 k ~ 2m - 3 - 1. 
Then 2 k ~ m+2 < 2 k+1 and so 



\j t,k,m\ > T t ,k,m — 1 = 



-1 + ^/1 + 2 k + 1 (t + 1) - 2 k ~ m + 2 



> 



-i + V¥+H 



Similarly, 



UJ.k.mJ > 



-3 + \/2 fc +^ 



Combining these two inequalities with ( |71~1 ). a) follows. 

Lemma 23: For fc > 6 we have 

E E ^ k ^ 



m=l t=l 



9 2fc-3-3| ^2 I 

2fc-6 ofc-1 



21 ' 

2^-3 _ 2*:-3-2L^i^J 



Proof: First we see that 

2 *-2m-3_ 1 

E Vi = -V2 



21 

fc-3 



+ 2 fc - 1 -L ii 2 1 J 



fc-2ro-3 



t=l 



E ^ 

t=i 







_2~ 2 m _(_ _ (2^ — 2m— 3 j 2 



Hence 

E E 2 ^ 

m=l t=l 

L ^ J /2 

rn — 1 

= A(2 2fe - 4 -2 2fe - 4 - 3 L^J) - (V- 1 - 2 ' £ - 1 -L i ^J 

Similarly, 

^ ^ 1 ~ 3 

m=i t=i 

The lemma follows from these results and Lemma l22h). 
Lemma 24: For fc > 6 we have 

k-l 2 fc ~ 5 -l 

E E ^ k , m 

m= L^3j+l t = l 

= 2 2fc - 6 -L^J - 2 k ~ 5 - 2 A; " 1 "L i ^J + 1 



fc-3 



and 



r t ,k,m =Lt.k, m + 1 = 2 
This proves b) and c). 



fc-TO-2 



L^J 2^-1 

E E 

m=l (= 2t-2ra-3 



2 4 3 I ^ ~~ ^ I 
3 _ 2 2fc-6 _ 2 2fc-6-L^j , £ 



7 



7 
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Proof: The result follows directly from Lemmas l22b ) and 
l22b ) respectively. ■ 
We can now combine these results into a proof of Theorem 
M 

Proof: From d70b and Lemmas [2214241 we get 

L^J 2 k - 2m - 3 -i 

= E E $ *. fc .™ 

m=l t=l 

fc-1 2 fc " 5 -l 

+ E E 

m=L^J+l 
Lt^J 2 fc - 5 -l 

+ E E 

m=l t =2 k - 2 ™- 3 

> 2 2fe ~ 6 ~L i ^J - 2 fc ~ 5 - 2 fc_1_ L i T Ji J + 1 

3 . 2fe _ 6 _ 2fc _ 6 _ L A=3j 2^ 

7 7 

2^. ^ 3I _ _ _ I 

+ -•2 -2 + 2 



2 /v 







)+ 


2 



_ £^ 2 2fe ^ 6 — — 2 k ~ 5 
21 ' 3 

+ 21 + 3 

Theorem Q3] follows from this expression. 



fc-3 
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